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(54) TiUe: STABLE COMPOSmONS FOR NUCLEIC ACID AMPLIHCATION AND SEQUENCING 



(57) Abstract 

This invention is directed to compositions compiis- 
ing mixtures of reagents, including thermostable enzymes 
(e.g., thermostable DNA polymerases), buffers, cofactors 
and other components, suitable for immediate use in nu- 
cleic acid amplification or sequencing techniques without 
dilution or addition of further components. The composi- 
tions contain no stabilizing agents (e.g., glycerol or serum 
albumin) and unexpectedly maintain activity for extended 
periods of time upon storage at temperature above freez* 
ing. These compositions arc useful, alone or in the form 
of kits, for nucleic acid amplification (e.g., by the Poly- 
merase Chain Reaction, as exemplified in the Figure) aiid 
sequencing (c.g., by dideoxy or '^Sanger" sequencing), or 
for any procedure utilizing thermostable DNA polymerases 
in a variety of medical, forensic and agricultural applica- 
tions. In particular, the compositions and methods arc use- 
ful for amplifying and sequencing nucleic acid molecules 
that are larger than about 7 kilobases in size. 
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Stable Compositions for 
Nucleic Acid Amplification and Sequencing 

5 

FELD OF THE INVENTION 

This invention is in the fields of molecular and ceUular biology. The invention is particulariy 
directed to reagent compositions for use in techmques whereby nucleic acids (DNA or RNA) are 
10 amplified or sequenced, and to methods of amplifying and sequencing long nucleic add molecules, 
especially via the polym^ase chain reaction (PGR). 

BACKGROUND OF THE INVENTION 

15 DNA Polymerases 

During growth and reproduction of viruses and cellular organisms, the DNA containing the 
parental genetic information must be &ithfiilly copied and passed on to the progeny. This highly 
regulated process of DNA replication is carried out in vivo by a complex of enzymes and assodated 
protons and cofectors (Romberg, A, Science /i/: 1503- 1508, 1959; Hibner, U., and Alberts, B.M., 

20 Nature 255:300-305, 1980). The primary enzymes taking part in this process are the DNA 
polymerases, which catalyze the addition of deoxynucleoside triphosphate (dNTP) bases into the 
newly forming DNA strands. Together with other enzymes (e.g., helicases, ligases and ATPases), 
the DNA polymerases thus oisure rapid and relatively faithfiil replication of DNA in preparation for 
proliferation in prokaryotes, eukaryotes and >aruses. 

25 DNA polymerases are also used to manipulate DNA in vitro in a variety of molecular genetic 

techrdques. These enzymes have proven usefiil not only for in vitro DNA synthesis, but also for 
determining the nucleotide sequence (ic, "sequencing") of DNA fragments or genes. This latter 
application relies on the bjcX that, in addition to an activity vAich adds dNTPs to DNA in the 5' to 
3' direction (i.e., "polymerase" activity), many DNA polymerases also possess activities vAadi 

30 remove dNTPs in the 5' to 3' and/or the 3' to 5' direction (i.e., "cxonuclease" activity). DNA 
polymerases can also be used for sequendng via thrir incorporation of labdled chain-terminating 
agents ^ch as dideoxynudeoside triphosphates {See U.S. Patent Nos. 4,962,020; 5,173,41 1; and 
5.498,523). Thus the same enzyme, e.g., DNA polymerases I and HI from the bacterium Escherichia 
coli, DNA polymerase y from animal cells or DNA polymerase Srom bacteriophage T7 {See U.S 

35 Patent No. 4,795,699), may be used in vitro both for DNA synthesis, involving the elongation of the 
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DNA strand, and for DNA sequencing, involving either the synthesis or the digestion of the DNA 
strand. 

The dual activity of certain DNA polymerases is, however, a drawback for some in vitro 
applications. For example, the in vitro synthesis of an intact copy of a DNA fragment by the 
5 polymerase activity, an elongation process which proceeds in a 5* to 3' direction along the template 
DNA strand, is jeopardized by the exonuclease activities which may simultaneously or subsequently 
degrade the newly formed DNA. To overcome this technical problem, a fragment of £. coii DNA 
polymerase I lacking the 5' to 3* exonuclease activity (named the "Klenow fragment" after its 
discoverer) is often employed for in vitro DNA synthesis. The Klenow fragment provides for in vitro 
10 DNA synthesis at approximately the same rate as intact E, coli DNA polymerase 1, but the newly 
synthesized DNA molecules are less subject to enzymatic degradation and are correspondingly more 
stable. 

Unfortunately, the error rate (/.e., the rate at which incorrect dNTPs are incorporated into 
the new DNA strand by the enzyme) is somewhat higher for the Klenow fragment than for many 

15 other conmionly used DNA polymerases, including E. coli DNA polymerases I and II and the 
polymerases from bacteriophages T4 and T7 (Kunkel, T.A , et al, J, Bioi Chem. 259:1539-1545, 
1984; Tindall, K.R., and Kunkel, T.A., Biochemistry 27:6008-6013, 1988; Mattila, P., et al., NucL 
Acids Res, 19:4967-4973, 1991). Thus, until recently the rates of synthesis, degradation and error 
had to be weighed together when choosing which DNA polymerase to use for in vitro DNA 

20 synthesis. 

DNA Sequencing 

In general, two techniques have been traditionally used to sequence nucleic acids. In the first 
method, termed "Maxam and Gilbert sequencing" after its co-developers (Maxam, A M. and Gilbert, 

25 W.. Proc, Natl. Acad Sci. USA 7^:560-564. 1977), DNA is radiolabeled, divided into four samples 
and treated with chemicals that selectively destroy specific nucleotide bases in the DNA and cleave 
the molecule at the ates of damage. By separating the resultant fragments into discrete bands by gel 
dectorphoresis and exposing the gel to X-ray film, the sequence of the original DNA molecule can 
be read from the film. This technique has been used to determine the sequences of certain complex 

30 DNA molecules, including the primate virus SV40 (Fiers, W., et al. Nature 273:1 13-120, 1978; 
Reddy, V.B., et aL, Science 200:494-502, 1978) and the bacterial plasmid pBR322 (Sutcliff^e, G., 
Cold Spring Harbor Symp, Quant. BioL -/5: 77-90, 1979) 
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An alternative technique for sequencing, named "Sanger sequencing" after its developer 
(Sanger, F., and Coulson, A.R., J, Mol Biol 9^:444-448, 1975), is more commonly employed. This 
method uses the DNA-synthesizing activity of DNA polymerases which, when combined with 
mixtures of reaction-terminating dideoxynucleoside triphosphates (Sanger, F., et al^ Proa Nail 
5 Acad ScL USA 7^:5463-5467, 1977) and a short primer (dther of which may be detectably labeled), 
gives rise to a series of newly synthesized DNA fragments specifically terminated at one of the four 
dideoxy bases. These fragments are then resolved by gel electrophoresis and the sequence 
determined as described for Maxam and Gilbert sequencing above. By carrying out four separate 
reactions (once vwth each ddNTP), the sequences of even fairly complex DNA molecules may rapidly 

10 be determined (Sanger, F., e/a/., Nature 265:678-695, 1977; Barnes, W., Meik Enzymol /J2:538- 
556, 1987). While Sanger sequencing usually employs E, coli or T7 DNA polymerase (U.S. Patent 
No. 4,795,699), recent modifications of this technique using T7 polymerase mutants allow 
sequencing to be accomplished using a single sequencing reaction containing all four chain- 
terminating ddNTPs at difiFerent concentrations (U.S. Patent Nos. 4,962,020 and 5,173,41 1). Further 

15 modifications to the technique, to reduce or eliminate the buildup of reacrion-poisoning 
pyrophosphate in the reaction mixtures, have also been described (U.S. Patent No. 5,498,523). 

The Polymerase Chain Reaction 

Soon after their identification and chaiactmzation, it was recognized that the activities of the 

20 various enzymes and cofactors involved in DNA synthesis could be exploited in vitro to dramatically 
increase the concentration o^ or "amplify," one or more selected DNA sequences. For many 
medical, diagnostic and forenac applications, amplification of a particular DNA sequence is essential 
to allow its detection in. or isolation from, a sample in which it is present in very low amounts. More 
recently, in vitro amplification of specific genes has provided powerful and less costly means to 

25 facilitate the production of therapeutic proteins by molecular biological techniques, and may have 
applications in genetic therapy as well. 

While a variety of nucleic acid amplification processes has been described, the most 
commonly employed is the Polymerase Chain Reaction (PCR) technique disclosed in U.S. Patent 
Nos. 4,683,195 and 4,683,202. In this process, a sample containing the nucleic acid sequence to be 

30 amplified (the "target sequence") is first heated to denature or separate the two strands of the nucleic 
add. The sample is then cooled and mixed with specific oligonucleotide primers which hybridize to 
the target sequence. Following this hybridization, DNA polymerase in a buffered aqueous solution 
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is added to the sample, along with a mixture of the dNTPs that are linked by the polymerase to the 
replicating nucleic acid strand. After allowing polymerization to proceed to completion, the products 
are again heat-denatured, subjected to another round of primer hybridization and polymerase 
replication, and this process repeated any number of times. Since each nucleic acid product of a 
5 given cycle of this process serves as a template for production of two new nucleic acid molecules 
(one from each parent strand), the PCR process results in an exponential increase in the 
concentration of the target sequence. Thus, in a well-controlled, high-fidelity PCR process, as few 
as 20 cycles can result in an over one million-fold amplification of the target nucleic acid sequence 
(See U.S. Patent Nos. 4,683,195 and 4,683,202). 

10 

Thermostable DNA Polymerases 
Overview 

Initially, the DNA polymerases of choice for use in DNA sequencing or PCR were £. coli 
DNA polymerase 1, the Klenow fragment, or T4 or T7 polymerases owing to their ease of isolation 

1 5 and well-characterized activities. However, the use of these enzymes necessitated their addition prior 
to the start of each sequencing or PCR cycle, due to their thermolability at the temperatures used to 
denature the DNA strands in the initial steps of the processes (typically 70*" to 95 °C) (Saiki, R.K., 
etal. Science 230:1350-1354, 1985;Mullis.K.B., andFaloona, F.A., A/e//t EnzymoL 755:335-350, 
1987; U.S. Patent No. 4,795,699). This need for the addition of fresh enzyme at the beginning of 

20 each cycle increased the amount of time required for these processes, and increased the risk of 
operator error and contamination of the samples during reagent introduction, often leading to 
undesirable results. 

These difficulties were partially overcome by the use of thermostable DNA polymerases in 
the PCR process (Saiki, R.K., et aL, Science 259:487-491, 1988). The thermostable DNA 

25 polymerase most commonly used in PCR is Tag polymerase, isolated from the thermophilic 
bacterium Thermus aquaiicus (Saiki etal,, 1988, M; US. Patent Nos. 4,889,818 and 4,965,188). 
Tag polymerase functions optimally at temperatures of 70-80**C, and is able to maintain substantial 
activity upon repeated exposure to temperatures of 92** -95 "C as are often used in the initial steps 
of PCR (Gelfand, D.H., and White, T.J., in: PCR Protocols: A Guide to Methods and Applications, 

30 Innis, MA, et al, eds.. Academic Press, pp. 129-141, 1989; Bej, A.K.. and Mahbubani, M,H., in: 
PCR Technology: Current Innovations, Griffin, H.G., and Griffin, A.M., eds., CRC Press, pp. 219- 
237, 1994). 
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The use of Tag polymerase in PCR eliminated the need to add fresh enzyme to the reaction 
mix prior to each PCR cycle. Instead, a quantity of Tag polymerase sufficient to catalyze DNA 
polymerization over the desired number of cycles can be mixed with the other components prior to 
the initiation of the first PCR cycle, and the enzyme continues to function throughout the repetitive 
5 cycles of inaeased and decreased temperatures. The use of Tag polymerase has also fecilitated the 
automation of the PCR process (Gelfand and White, Id), thereby at once dramatically reducing time 
constraints and the risks of operator error and sample contamination that are problematic with 
thermolabile polymerases. Currently, most PCR amplification of nucleic acids for industrial and 
academic applications is performed using Tag polymerase and automated thermal cycling 
10 instrumentation. 

In addition to Tag polymerase, other thermostable polymerases have found similar application 
in PCR (Bej and Mahbubani, Id). Particularly useful as substitutes for Tag polymerase in PCR are 
polymerases isolated from the thermophilic bacteria Thermus thermophilus (Tth polymerase), 
Thermococcus litoralis (77/ or VENT™ polymerase), Pyrococcus furiosus (Pfii or DEEPVENT 

15 polymerase), Pyrococcus woosii (Pwo polymerase) and other Pyrococcus species. Bacillus 
sterothermophilus (Bst polymerase), Sulfolobus acidocaldarius (Sac polymerase), Thermoplasma 
acidophilum {Tac polymerase), Thermus flavus {TfUTub polymerase), Thermus ruber (Tru 
polymerase), Thermus brockianus (DYNAZYME™ polymerase), Thermotoga neapolitana {Tne 
polymerase; See WO 96/10640), Thermotoga maritima {Tma polymerase; See U. S. Patent No. 

20 5,374,553) and other species of the Thermotoga genus {Tsp polymerase) and Methanobactehum 
thermoautotrophicum (Mth polymerase). While each of these polymerases is useful for particular 
appUcations {See Bg and Mahbubani, Id., p. 222), Tag polymerase is still by far the most commonly 
used polymerase in PCR. 

Thermostable polymerases have also found application in DNA sequencing techniques, 

25 particularly in autonaated methods of dideoxy sequencing such as "cycle sequencing." These 
approaches resemble PCR in most respects except that, in place of dNTPs, automated DNA 
sequencing uses ddNTPs which allow determination of the sequence of the template DNA as 
described above. Use of higher denaturation temperatures in automated sequencing also improves 
sequencing efficiency (/.e., fewer misincorporations occur) and allows the sequencing of templates 

30 that are GC-rich or contain significant secondary structure (such as supercoiling). 

The use of thennolabile DNA polymerases such as £. coli or T7 DNA polymerases in these 
approaches, however, is subject to the same limitations described above for their use in PCR. 
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Accordingly, automated methods of DNA sequencing utilizing higher temperatures have increasingly 
employed thermostable DNA polymerases, the most commonly used of which is, as for PCR, Taq 
polymerase. 

5 Technical Limitations 

The use of Taq and other thermostable polymerases in sequencing and PCR is not, however, 
without drawback. For example, the error rate for Taq polymerase is substantially higher {/.e., the 
final product is of "lower fidelity") than that for most of the thermolabile DNA polymerases, 
including the Klenow fi^agment of E. coii DNA polymerase 1 (Tindall and Kunkel, Id), averaging 

10 about ICh- misincorporations per base pair per cycle. In addition, Taq polymerase is only useful for 
amplifying relatively short stretches of DNA (maximum length on the order of 5-6 kilobases; Barnes, 
W.M., Proc. NaiL Acad Set USA 97:2216-2220, 1994), thus precluding its use in PCR amplification 
of large genes and whole genomes as is necessary in many current applications. 

These technical limitations are apparently related: it has been theorized that the Taq 

1 5 polymerase PCR kngfh limitation is due to the low efficiency of elongation of the newly synthesized 
DNA strands at the sites of incorporation of mismatched bases in the parent strands (Barnes, Id), 
Further contributing to this difficulty is the absence in Taq polymerase of a 3* to 5* exonuclease 
activity, which in other polymerases acts in a "proofi^eading" capacity to correct these mismatches 
and reduce the error rate (Bej and Mahbubani, Id). The 5* to 3' exonuclease activity present in most 

20 thermostable DNA polymerases can also degrade the 5' ends of the oligonucleotide primers (also a 
complication with the 3* to 5' exonuclease activity which can degrade the 3' ends of the primers), 
yielding undesirable results due to an early termination of the PCR process (See WO 92/06200; 
Barnes, Id), 

In sequencing reactions, Taq polymaase is subject to a limitation shared by R coli 
25 polymerase I and the K1qk)w firagment. These enzymes each are "discriminatory," meaning that they 
preferentially incorporate dNTPs over ddNTPs into newly synthesized DNA. Thus, to use Taq 
polymerase in automated sequencing reactions, relatively high concentrations of ddNTPs must be 
maintained in the reaction mixtures, to kinetically favor ddNTP incorporation by the enzyme. This 
need for high levels of ddNTPs can be prohibitively expensive, particulariy when large DNA 
30 fi"agments are being sequenced. 

As another technical limitation, the DNA polymerases have heretofore been maintained in 
highly concentrated stock solutions in storage buffers containing glycerol, bovine serum albumin 
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and/or other stabilizing agents and stored at -20*C or lower (See, e.g., WO 92/06188; U.S. Patent 
No. 5,436,149). The conventional understanding in the field has been that the enzymes would rapidly 
lose activity at more dilute working concentrations (as do many bioactive proteins) and in solutions 
vwthout glycerol or other stabilizing agents. Moreover, the solutions of enzymes had to be mixed 
5 with dNTPs or ddNTPs, cofactors (such as Mg^) and one or more detergents immediately prior to 
use in the sequencing or PCR processes, as it was believed that premixture and storage of these 
solutions would also deleteriously affect their stability. In addition, dNTPs have traditionally been 
stored at temperatures below -20*C and have also been thought to be unstable if stored otherwise 
(Maniatis, T., etai. Molecular Cloning, A Laboratory Manual, CRC Press, 1992). Together, these 
10 limitations have made the use of compositions containing thermostable DNA polymerases in DNA 
sequencing and PCR more costly and time-consuming than would be desired. 

Overcoming Technical Limitations 

Several approaches have been undertaken to attempt to surmount these technical difficulties. 

15 The results of DNA sequencing methodologies, for example, have been improved by the use of 
mutant Tag enzymes such as UTaq (WO 92/06188) lacking the 5' to 3' exonuclease activity. 
Improved sequencing results have also been obtained by including in the reaction mixture an agent 
such as pyrophosphatase which breaks down the pyrophosphate that can be formed during dideoxy 
sequwicing reactions (See U.S. Patent Number 5,498,523). To overcome the discrimination between 

20 ddNTPs and dNTPs, some investigators have used T7 DNA polymerase in sequencing, as this 
CTzyme is "nondiscrintinatory,*' meaning that it incorporates ddNTPs at approximately the same rate 
as dNTPs (See U.S. Patent No. 4,795,699). Alternatively, mutants of DNA polymerase from a 
variety of organisms (e,g, R coli) which are nondiscriminatory have also been described; see 
copending U.S. Patent Application No. 08/525,057 of Deb K. Chatteijee, filed September 8, 1995, 

25 entitled "Mutant DNA Polymerases and Use Therof," the disclosure of which is expressly 
incorporated herein by reference. However, as described above, both £ coli and T7 DNA 
polymerases are thermolabile, so their use in automated sequencing requires addition of fresh enzyme 
at the beginning of each cycle. 

More recently, mutations in Tne polymerase from Thermotoga neapolitana have been 

30 described, which overcome these limiutions (WO 96/10640), One of these mutations, in which a 
phenylalanine residue at amino acid position number 730 in the wildtype protein (SEQ ID N0:1) is 
replaced with a tyrosine residue, results in a mutant Tne polymerase (SEQ ID N0:2) which is both 
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thermostable owe/ nondiscriminatory. This mutant Tm polymerase thus provides a solution to both 
the problems of thermolability and ddNTP discrimination found in other enzymes used in automated 
DNA sequencing. See also the co-pending U.S. Patent Application of A. John Hughes and Deb K. 
Chatteijee, entitled "Cloned DNA Polymo-ases from Thermotoga and Mutants Therof," filed on even 

5 day herewith, which is incorporated by reference herein in its entirety. 

In PGR applications, the low fidelity of Tii^-produced PCR products has been alleviated to 
some extent by the use of DNA polymerase which contains the proofreading 3' to 5' exonuclease 
activity lacking in Taq polymerase (Lundberg, K.S-, et aL, Gem 108. 1-6, 1991). Other thermostable 
DNA polymerases, including 77/ATENT™ (Bej and Mahbubani, Id) and DEEPVENT™ (Flaman, 

10 }.-M.,eiaL,NucL Acids Res, 22(1 5)3259-3260, 1994) have also been shown to improve the fidelity 
of PCR products. 

As a means of overcoming this length limitation, mutant enzymes lacking the 5' to 3* 
exonuclease activity have been prepared, including N-terminal deletion mutants of Taq polymerase 
that are analogous to the Klenow fragment of K coli DNA polymerase I Several of these mutant 

15 enzymes, including Klentaq-1 and Klentaq-278 (Barnes, W.M., Gene 772:29-35, 1992; U.S. Patent 
No. 5,436,149), the TaqStoSd fragment (Lawyer, F.C., eiaL, PCRMeth. Appl 2:275-287, 1993), 
and mutants of other thermostable DNA polymerases lacking the 5' to 3' exonuclease activity (e.^., 
those disclosed in WO 92/06200), have been shown to provide increasingly stable PCR products and 
primers. However, as these enzymes also lack the 3' to 5' proofreading activity, their use subjects 

20 the PCR process to the increased error rates described above. Thus, even with these mutant 
enzymes, high-fidelity PCR amplification of DNA fragments larger than 5-6 kilobases has proven 
exceedingly diSBcuh. 

By combining spedfic quantities of several en^mes, however, high-fidelity PCR amplification 
of large DNA sequences has been achieved. For example, use of a combination of a high 

25 concentration of a thermostable DNA polymerase lacking the 3' to 5' exonuclease activity (e.^., 
Klentaq278) and a low concentration of a thermostable DNA polymerase exhibiting the 3' to 5' 
exonuclease activity (e.g., /^/i/ZDEEPVENT or THN^KY^) provides for amplification to high 
concentrations of DNA sequences of at least 35 kilobases in length with significantly improved 
fidelity (Barnes, Id\ U.S. Patent No. 5,436,149). Similar resuhs have been obtained with mixtures 

30 of Tth polymerase and low levels of thermostable polymerases from Pyrococcus, Tli or Tma (U.S. 
Patent No. 5,512.462). Apparently, the low level of 3' to 5' exonuclease activity is suflScient to 
remove any mismatched bases incorporated by the majority polymerase, but is insufficient to 
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significantly degrade the primers. While this approach has heretofore been applied to simple DNA 
sequences such as those from bacteriophage X, it may prove applicable to larger and more complex 
sequences as well, including those of the genomes of bacteria, yeast, plants and animals. 

Despite these successes in overcoming the ddNTP discrimination, length and fidelity 
limitations, however, compositions comprising the reagents necessary for DNA sequencing or PGR 
capable of extended storage above freezing at working concentrations, without stabilizing agents, 
have not heretofore been reported. Thus, the time and economic constraints to the use of solutions 
of thermostable enzymes in most ^plications have yet to be overcome. 

BRIEF SUMMARY OF THE INVENTION 

The present invention overcomes these temporal and economic limitations of previously 
available reagent compositions used in nucleic acid amplification and sequencing methods. 
Specifically, the invention is directed to compositions comprising mixtures of reagents at working 
concentrations suitable for use with or without dilution and maintaining activity upon storage for an 
extended time, said mixtures consisting essentiaUy of at least one thermostable enzyme and at least 
one buffer salt. The invention fiirther provides such compositions for use in nucleic add 
amplification fiirther comprising at least one deoxynucleoside triphosphate, magnesium salts and at 
least one nonionic detergent, wherein the thermostable enzyme is at least one thermostable DNA 
polymerase. In another embodiment, the invention provides such compositions for use in nucleic acid 
sequencing fiirther comprising at least one deoxynucleoside triphosphate, at least one 
dideoxynucleoside triphosphate, magnesium salts and at least one nonionic deteigent, wherein the 
thermostable enzyme is at least one thermostable DNA polymerase. More specifically, the invention 
is directed to such compositions wherein the thermostable DNA polymerase is Tag, Tne, Tma, Pfu, 
Pwo or m DNA polymerase, or mutants thereof preferably at concentrations of about 0. 1-200 units 
per milliliter, about 0. 1-50 units per milliliter, about 0. 1-40 units per miUUiter, about 0. 1-36 units per 
milliliter, about 0.1-34 units per milliliter, about 0.1-32 units per milliUter. about 0.1-30 units per 
milliliter, or about 0.1-20 units per milliliter, and most preferably at concentrations of about 20 units 
per milliliter. The invention is also directed to such compositions that fiirther comprise VENT or 
DEEPVENT™ DNA polymerase, preferably at concentrations of about 0.0002-200 units per 
milliliter, about 0.002-100 units per milliliter, about 0.002-20 miits per miUiUter. about 0.002-2.0 
units per milliliter, about 0.002-1.6 units per milliliter, about 0.002-0.8 units per milliliter, about 
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0.002-0.4 units per milliliter, or about 0.002-0.2 units per milliliter, most preferably at concentrations 
of about 0.40 units per milliliter. The invention is further directed to kits for DNA amplification or 
sequencing, said kits comprising a carrier means, such as a box, carton, tube or the like, having in 
close confinement therdn one or more container means, such as vials, tubes, ampules, bottles or the 
like, wherein a first container means contains a composition comprising a mbcture of reagents at 
working concentrations suitable for use without dilution and maintaining activity upon storage for 
extended time, said mixture consisting essentially of at least one thermostable DNA polymerase, 
buffer salts, magnesium salts and at least one nonionic detergent. The first container means may also 
contain a mixture of dNTPs (for PCR applications) or ddNTPs (for sequendng applications). 
Alternatively, the dNTPs or ddNTPs may be included in a second container means also closely 
confined within the carrier means of the kit. 

The invention is also directed to methods of amplifying or sequ^dng a nucleic acid molecule, 
comprising contacting the nucleic acid molecule to be amplified or sequenced, which is preferably 
larger than about 4-8 kilobases in size, more preferably larger than about 5-7 kilobases in size and 
most preferably larger than about 7 kilobases in size, with the compositions of the invention. The 
invention also provides nucleic acid molecules amplified by these methods. 

The present invention is also directed more generally to compositions containing thermostable 
proteins or enzymes useful in molecular biology. These compositions comprise mixtures of reagents 
at working concentrations suitable for use with or without dilution which maintain the enzyme or 
protein activity upon storage for an extended time. The compositions of this aspect of the invention 
comprise at least one thermostable enzyme and at least one buffer salt. The thermostable enzymes 
in these compositions include, but are not limited to, polymerases, restriction enzymes, alkaline 
phosphatases, reverse transcriptases, ligases, nucleases, pyrophosphatases, DNAses, RNAses, 
exonucleases, RNAse inhibitors, kinases, topoisomerases, guanylyltransferases and glycosylases (e.^., 
uracil DNA glycosylase). 

The invention is further directed to kits for conducting a procedure associated with the 
thermostable enzymes or proteins (restriction enzymes, phosphatases, etc.), the kits comprising a 
container means such as a box, carton, tube and the like, having in close confinement therein one or 
more container means, such as a vial, tube, ampule, bottle or the like, wherein a first container means 
contains a composition comprising a mixture of reagents at working concentrations suitable for use 
with or without dihiUon. The reagents in the first container means include at least one thermostable 
enzyme or protein and at least one buffer salt. 
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The compositions of the invention have unexpectedly been found to maintain enzyme activity 
for extended periods of time compared to conventional compositions. For example, the present 
compositions maintain enzyme activity for at least four weeks when stored at ambient temperature 
(about 20°-25*'C), for at least one year at about 4**C and for at least two years at about -20**C. 
5 These stable, ready-to-use reagent compositions are useful for nucleic add amplification 

(including the Polymerase Chain Reaction or PGR) and sequencing (including dideoxy sequencing), 
or for any procedure using thermostable DN A polymerases or other enzymes (restriction enzymes, 
phosphatases, kinases, etc.) in fields such as medical therapeutics and diagnostics, forensics and 
agricultural science. 

10 Other features, advantages and applications of the present invention will be apparent to those 

skilled in the art from the following description of the preferred embodiments thereof, and from the 
claims. 



BRIEF DESCRIPTION OF THE DRAWINGS 

15 

Figure 1 is a photograph of an agarose gel (visualized by ethidium bromide fluorescence 
under ultraviolet illumination) of PCR amplification of a 1 .3 kilobase human genomic DNA fragment 
from 10 nanograms of template using 15 different reagent compositions (corresponding to samples 
10-24 in Table 1) stored as indicated in Table 1, and a freshly made control sample. Lane mariced 
20 "M" contains markers indicating amount of DNA loaded; bands correspond to (from top to bottom) 
100 nanograms, 60 nanograms, 40 nanograms and 20 nanograms of DNA mass markers. 



Figure 2 is a photograph of an agarose gel (visualized by ethidium bromide fluorescence 
under ultraviolet ilhimination) of PCR amplification of a 4. 1 kilobase human genomic DNA fragment 
using the amounts indicated Cm nanograms) of template for each amplification reaction. The samples 
on the upper portion of the gel were amplified with a nucleic acid amplification composition stored 
for 10 weeks at 4X. while those on the lower portion of the gel were amplified with a composition 
stored for 10 weeks at ambient temperature (about 20^25^). Leftmost lanes in each portion are 
markers as indicated in Figure 1. 



30 
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DETAILED DESCRIPTION OF THE INVENTION 



Deflnitions 

Throughout this disclosure, various terms that are generally understood by those of routine 
5 skill in the art are used. Certain terms as used herein, however, have specific meanings for the 
purposes of the present invention. The term "dNTP" (plural "dNTPs") generically refers to the 
deoxynucleoside triphosphates {e.g., dATP, dCTP. dGTP, dTTP, dUTP, dITP, T-deaza-dGTP, 
adATP. adTTP. adGTP and odCTP), and the term "ddNTP" (plural "ddNTPs") to their dideoxy 
counterparts, that are incorporated by polymerase enzymes into newly synthesized nucleic acids. The 

1 0 term "unit" as used herein refers to the activity of an enzyme. When referring to a thermostable DNA 
polymerase, one unit of activity is the amount of enzyme that will incorporate 10 nanomoles of 
dNTPs into add-insoluble material (i.e., DNA or RNA) in 30 minutes under standard primed DNA 
synthesis conditions. "Working concentration" is used herein to mean the concentration of a reagent 
that is at or near the optimal concentration used in a solution to perform a particular function (such 

15 as amplification, sequencing or digestion of nucleic acids). The term "detergent" as used herein 
rrfers to a nonionic surfactant such as TRITON X-100®, Nonidet P-40 (NP-40), Tween 20 or Brij 
35. The terms "stable" and "stability" as used herein generally mean the retention by an enzyme of 
at least 70%, preferably at least 80%, and most preferably at least 90%, of the original enzymatic 
activity (in units) after the enzyme or composition containing the enzyme has been stored for at least 

20 four weeks at a temperature of about 20-25 °C, at least one year at a temperature of about A'C or 
at least 2 years at a temperature of -20''C. 

Overview 

The present invention provides, in a first preferred embodiment, compositions comprising 
25 mixtures of at least one th«Tnostable enzyme (e.g., a thermostable DNA polymerase, restriction 
enzyme, etc.), at least one buffer salt, and other reagents necessary for carrying out the procedure 
associated with the enzyme(s) (e.g., deoxynucleoside triphosphates (dNTPs) for ampDfication of 
nu<fcic acids, dNTPs and dideojqmucleoside triphosphates (ddNTPs) for sequencing of nucleic acids, 
etc.). These compositions contain no stabilizing compounds (e.g. glycerol, serum albumin or 
30 gdatin) that have been traditionally included in stock reagent solutions, and exhibit increased stability 
(measured as maintenance of enzyme activity) even upon storage at temperatures above fi-eezing. 
Furthermore, the invention provides these reagent compositions in ready-to-use concentrations. 
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obviating the time-consuming dilution and pre-mixing steps necessary with previously available 
solutions. Unexpectedly, even at these diluted concentrations the reagent compositions are stable 
for extended periods of time at temperatures ranging from ambient (about 20-25 "C) to about -lO'C 
In additional preferred embodiments, the present invention provides these ready-to-use 
5 compositions in the form of kits that are suitable for immediate use to cany out the procedure 
associated with the enzyme(s) (c.^., nucleic acid amplification or sequencing in the case of DNA 
polymerases). These kits are also stable for extended periods of time at temperatures ranging from 
ambient (about 20-25''C) to -70'C. 

1 0 Sources of Reagents 

The compositions of the present invention may be formed by mixing the component reagents 
at the concentrations described below. The components for making the ready-to-use compositions 
can be obtained from, for example. GIBCO/LTI (Gaithersburg, Maryland). 

15 Thermostable Enzymes 

The thermostable enzymes {e.g., DNA polymerases, restriction enzymes, phosphatases, etc.) 
used in the present invention may be isolated from natural or recombinant sources, by techniques that 
are well-known in the art {See Bej and Mahbubani. Id; WO 92/06200; WO 96/10640), from a 
variety of tiiermophilic bacteria that are available commercially (for example, from American Type 

20 Culture CoUection. Rockville. Maryland) or may be obtained by recombinant DNA techniques (WO 
96/10640). Suitable for use as sources of thermostable enzymes or the genes thereof for expression 
in recombinant systems are the tiienmophilic bacteria Thermus thermophUus, Thermococcus litoralis, 
Pyrococcusfuriosus. Pyrococcus woosii and other species of the Pyrococcus genus. Bacillus 
sterothermophilus, Sulfolobus addocaldarius, Thermoplasma acidopMum, Thermus fla»is, 

25 Thermus ruber, Thermus brochams, Thermotoga neapolitaru,, Thermotoga maritima and other 
species of the Thermotoga genus, and Methanobacterium thermoautotrophicum, and mutants 
thereof It is to be understood, however, that thermostable enzymes from other organisms may also 
be used in die present invention witiKHrt departing from the scope or preferred embodiments thereof 
As an alternative to isolation, thermostable enzymes (e.g., DNA polymerases) are available 

30 commercially from, for example, GIBCO/LTI (Gaithersburg, Maryland), New England Biolabs 
(Beverly. Massachusetts), Fimizymes Oy (Espoo. Fmland) and Perkin Elmer Cetus (Norwalk, 
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Connecticut). Once obtained, the purified enzymes may be placed into solution at working 
concentrations and stored according to the methods of the present invention. 

dNTPs 

5 The dNTP components of the present compositions serve as the "building blocks" for newly 

synthesized nucleic acids, bdng incorporated therein by the action of the polymerases. These dNTPs 
- deoxyadenosine triphosphate (dATP), deoxycytosine triphosphate (dCTP), deoxyguanosine 
triphosphate (dGTP), deoxythymidine triphosphate (dTTP), and for some applications deoxyuridine 
triphosphate (dUTP) and deoxyinosine triphosphate (dITP), a-thio-d ATP and 7-deaza-dGTP - are 

1 0 available commercially fi-om sources including GIBCO/LTI (Gaithersburg, Maryland), New England 
Biolabs (Beverly, Massachusetts) and Sigma Chemical Company (Saint Louis, Missouri). The 
dNTPs may be unlabeled, or they may be detect ably labeled by coupling them by methods known in 
the art with radioisotopes (e.g., H , »C, «P or »S), vitamins (e.g., biotin), fluorescent moieties (e.g, 
fluorescdn, rhodamine, Texas Red, or phycoerythrin) or other detection agents. Labeled dNTPs may 

1 5 also be obtained commercially, for example from GIBCO/LTI (Gaithersburg, Maryland) or Sigma 
Chemical Company (Saint Louis, Missouri). Once obtained, the dNTPs may be placed into solution 
at working concentrations and stored according to the methods of the present invention. 

ddNTPs 

20 The ddNTP components of the present compoations serve as the "terminating agents" in the 

dideoxy nucleic acid sequencing methodologies, being incorporated into newly synthesized nucleic 
adds by the action of the polymerases. These ddNTPs ~ dideoxyadenosine triphosphate (ddATP), 
dideoxycytosine triphosphate (ddCTP), dideoxyguanosine triphosphate (ddGTP), dideoxythymidine 
triphosphate (ddTTP), and for some applications dideoxyuridine triphosphate (ddUTP) and 

25 dideoxyinosine triphosphate (ddlTP) - are available conunercially firom sources including 
GIBCO/LTI (Gaithersburg, Maryland), New England Biolabs (Beverly. Massachusetts) and Sigma 
Chemical Company (Saint Louis, Missouri). The ddNTPs may be unlabeled, or they may be 
detectably labeled by coupling them by methods known in the art with radioisotopes (e.g., 'H, "C, 
"P or "S), vitamins (e,g,, biotin), fluorescent moieties (e.g., fluorescein, rhodamine, Texas Red, or 

30 phycoerythrin) or other detection agents. Labeled ddNTPs may also be obtained conunercially, for 
example from GIBCO/LTI (Gaithersburg, Maryland) or Sigma Chemical Company (Saint Louis, 
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Missouri). Once obtained, the ddNTPs may be placed into solution at working concentrations and 
stored according to the methods of the present invention. 

Buffers/Saits 

5 All buffers and cofactor salts comprising the compositions of the present invention, and 

concentrated stock solutions thereof, are available from a variety of commercial sources including 
GIBCO/LTI (Gaithersburg, Maryland) and Sigma Chemical Company (Saint Louis, Missouri). 
Particularly preferred buffers for use in forming the present compositions are the sulfate, 
hydrochloride, phosphate or free acid forms of tris-(hydroxymethyl)aminomethane (TRIS®). 
10 although alternative buffers of the same approximate ionic strength and pKa as TRIS® may be used 
with equivalent results. In addition to the buffer salts, cofactor salts such as those of potassium 
(preferably potassium chloride) and magnesium (preferably magnesium chloride or sulfate) are 
included in the compositions. Once obtained, the buffers and cofactor salts may be placed into 
solution at working concentrations and stored according to the methods of the present invention. 

15 

DetergetUs 

At least one detergent may be included as a component of the present composiUons, to 
provide for both increased stability and activity of the component enzymes. Nonionic detergents are 
preferred, to maintain a balanced ionic strength and prevent chelation of cofactors and aggregation 

20 or inactivation of proteins. Particularly preferred as detergents are TRITON X-IOO® , Brij 35, 
Tween 20 and Nonidet P-4G (NP-40), although other nonionic surfactants and mixtures thereof may 
also be used in the present compositions. These detergents are available commercially from sources 
such as Sigma Chemical Company (Saint Louis, Missouri), usually as concentrated aqueous solutions 
or in powder form. Once obtained, the detergents may be placed into solution at working 

25 concentrations and stored according to the methods of the present invention. 

Formulating the Reagent Compositions 

Once the reagent components have been obtained, they are mixed at working concentrations 
to form a solution suitable for immediate use with or without diluUon or addition of further reagents. 
30 The water used in the formulations of the present invention is preferably distilled, deionized and 
sterile filtered (through a 0.1-0.2 micrometer filter), and is free of contamination by DNase and 
RNase enzymes. Such water is available commercially, for example from Sigma Chemical Company 



wo 98/06736 



-16- 



PCrAJS97/14266 



(Saint Louis, Missouri), or may be made as needed according to methods well known to those skilled 
in the art. 

Although the components of the present compositions may be admixed in any sequence, it 
is often preferable to first dissolve the bufFer(s) and cofactor salts in water and to adjust the pH of 

5 the solution prior to addition of the remaining components. In this way, the pH-sensitive 
components (particularly the enzymes, ddNTPs and dNTPs) will be less subject to acid- or alkaline- 
hydrolysis during formulation. 

To formulate the buffered salts solution, a buffer salt which is preferably a sail of 
tris(hydroxymethyl)aminomethane (TRIS®), and most preferably the hydrochloride salt thereof, is 

10 combined with a suflBdent quantity of water to yield a solution having a TRIS(E) concentration of 5- 
150 millimolar, preferably 10-60 millimolar, and most preferably about 20-60 millimolar. To this 
solution, a salt of magnesium (preferably either the chloride or sulfate salt thereof) may be added to 
provide a working concentration thereof of 1-10 millimolar, preferably 1.5-5 millimolar, and most 
preferably about 1.5-2 millimolar. A salt of potassium (most preferably potassium chloride) may also 

1 5 be added to the solution, at a working concentration of 1 0- 1 00 millimolar and most preferably about ^ 
50 nullimolar. An ammonium salt, for example ammonium sulfate, may also be added to the mixture, 
at a working concentration of 2-50 millimolar, preferably 10-30 millimolar and most preferably 18 
millimolar. Combinations of ammonium sulfate and potassium chloride (or other salts) may also be 
used in formulating the compositions of the present invention. A small amount of a salt of 

20 ethylenediaminetetraacetate (EDTA) may also be added (preferably about 0. 1 millimolar), although 
inclusion of EDTA does not appear to be essential to the funcUon or stability of the compositions 
of the present invention. After addition of all buffers and salts, this buffered salt solution is mixed 
wdl until all salts are dissolved, and the pH is adjusted using methods known in the art to a pH value 
of 7.4 to 9.2, preferably 8.0 to 9.0, and most preferably about 8.3 for compositions to be used in 

25 amplification or sequencing of nucleotide fragments up to about 5-6 kilobases in size (hereinafter 
referred to as "standard composltons"), and about 8.9 for compositions to be used for amplification 
or sequencing of nucleotide fragments larger than about 5-6 kilobases in size (hereinafter referred 
to as "large sequence compositions"). 

To the buffered salt solution, the remaining components of the present composition are 

30 added. It is wdl known in the field that the addition of one or more detergents to an aqueous bufifer 
will aid in the subsequent solubilization of added proteins. Accordingly, at least one nonionic 
detergent such as TRITON X-100(g> (preferably at a working concentration of 0.1-1%), Brij 35 



wo 98/06736 



-17- 



PCTAJSm4266 



(preferably at a concentration of 0.01-1% and most preferably of about 0.1%) or Nonidet P.40 
(NP-40, preferably as an admixture with a concentration of 0.004-1%, and most preferably in 
admixture with Tween 20 at a working concentration of 0. 1% for standard compositions and 0.02% 
for laiBe sequence compositions) may be added to the buffer solution. This detergent is preferably 
5 added prior to the introduction of the remaining components into the solution, although the detergent 
may equivalently be added at any step of formulation. FoUowing formulation, the buffered salt 
solutions may be filtered through a low protein-binding filter unit that is available commercially (for 
example from Millipore Corporation, Bedford, Massachusetts) and stored until use. 

The remaining components are then added to the solution to formulate the compositions of 

1 0 the present invention. At least one thermostable enzyme (e.g., DNA polymerase) is added and the 
solution is gently mixed (to minimize protein denaturation). For standard DNA amplification 
(including via PCR) or sequencing of DNA segments up to about 5-6 kilobases in length, any 
thomostable DNA polymerase (hereinafter the "primary polymerase") may be used in the standard 
compositions, although Tag, Tne, Tma, VENT™, DEEPVENT™, PJu or Pwo polymerases are 

1 5 preferable at a working concentration in the solution of about 0. 1 -200 units per milliliter, about 0. 1 - 
50 units per milliliter, about 0.1-40 units per milliliter, about 0.1-36 units per milliliter, about 0.1-34 
units per milliliter, about 0. 1-32 units per milliliter, about 0. 1-30 units per milliliter, or about 0. 1-20 
units per milliliter, and most preferably at a working concentration of about 20 units per milliliter. 
For amplification of DNA segments larger than 5-6 kilobases in length, large sequence compositions 

20 should be formulated by adding to the standard compositions a low concentration of one or more 
additional thoroostable DNA polymerases (hermafto- the "secondary polymerase") containing a 3'-5' 
exonuclease activity. Particularly suited for this appfication are VENT™, Pfit, Pwo or Tne, and most 
pnefenibly DEEPVENT™, DNA polymerases. The additional poIymerase(s) should be added to the 
solution in suffident quantity to give a final woricing concentration of about 0.0002-200 units per 

25 millifiter, about 0.002-100 units per milliliter, about 0.002-20 units per milliliter, about 0.002-2.0 
units per miUiliter, about 0.002-1.6 units per milliliter, about 0.002-0.8 units per milliliter, about 
0.002-0.4 units per milliliter, or about 0.002-0.2 units per milHliter, most preferably at concentrations 
of about 0.40 units per milliliter. 

It has heretofore been thought that the activity ratios of the primary to secondary polymerases 

30 should be maintaied at 4:1 -2000:1 for large sequence amplification (j^e U.S. Patent No. 5,436,149). 
It has now been discovered, however, that in the compositions of the present invemion that activity 
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ratiosoftheprimarytosecondary polymerases of 1:1, 1:2, 1:4, 1:5, 1:8, 1:10, 1:25, 1:50, 1 100 
1 :250, 1 :500. 1 : 1 000 and 1 :2000 may be suitable for amplification of large nucleotide sequences. 

For nucleic acid sequencing, the reagent compositions may be used as formulated above. For 
nucleic acid sequencing by the dideoxy method (See U.S. Patent Nos. 4,962,020, 5,173,411 and 
5 5,498,523), howevCT, preferably the mutant Trte DNA polymerase shown in SEQ ID N0:2 is added 
to the reagent compositions. Tm polymerase is added to the solution to give a working 
concentration of about 0.1-10,000 units per milliliter, about 0.1-5000 units per milliliter, about 0.1- 
2500 units per milliliter, about 0. 1-2000 units per milliliter, about 0. 1 -1 500 units per milliliter, about 
O.MOOO units per milliliter, about 0. 1-500 units per milliliter, about 0. 1-300 units per milhliter, about 

10 0. 1.200 units per milliliter, about 0. 1-100 units per milliliter, or about 0, 1-50 units per milliliter, and 
most preferably of about 300 units per milliliter. 

For dideoxy sequencing, a solution of each ddNTP is also prepared. The base of each 
solution contains dATP, dCTP, dTTP, 7-deaza-GTP and/or other dNTPs, each at a working 
concentration of about 10-1000 micromolar, about 10-500 micromolar, about 10-250 micromolar, 

15 or about 10-100 micromolar, most preferably at a concentration of about 100 micromolar, in a 
solution of buffer and chelating salts, for example TRIS®-HCI most preferably at a working 
concentration of about 10 millimolar (pH about 7.5) and disodium-EDTA most preferably at a 
concentration of about 0.1 millimolar. To this base, one of the ddNTPs is added to make each of 
four solutions. Preferably, the sodium or lithium salt of ddATP, ddCTP, ddGTP or ddTTP is added 

20 to the solution to pve a woridng concoitration of the ddNTP of about 0.5-10 micromolar, about 0.5- 
8 miax)molar, about 0.5-5 micromolar, about 0.5-3 micromolar, about 0.5-2.5 micromolar, or about 
0.5-2 miCTomolar, and most preferably about 2 nucromolar. For cycle sequencing applications, the 
pH of the ddNTP solutions will preferably be about 9.0, and the concentrations of ddNTPs may be 
lowCT, preferably about 0.05 to 1.0 mi<Tomolar or about 0.05 to 0.8 micromolar, and most preferably 

25 about 0.08 to 0.8 micromolar. For some applications, it may be desirable to also incorporate or 
substitute ddlTP, ddUTP, and/or a-thio-dATP into the compositions at approximately the same 
woridng concentrations. Thus, four solutions are prepared, each containing one of the four ddNTPs, 
which are combined with the polymerase compositions of the present invention to carry out the four 
separate reactions used in dideoxy sequencing. Alternatively, for single-solution sequencing as 

30 disclosed in U.S. Patent Nos. 4,962.020 and 5,173,41 1, the four ddNTPs may be combined into a 
single solution which is added to the polymerase compositions of the present invention to perform 
the sequencing reaction. 
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For nucleic acid amplification, including PGR, dNTP salts are added to the reagent 
conjpoations. Preferably, the sodium or lithium salts of dATP, dCTP, dGTP and dTTP are added 
to the solution to give a working concentration of each dNTP of 10-1000 micromolar, preferably 
200-300 micromolar, and most preferably about 200 micromolar. For some applications, it may be 
5 desirable to also incorporate or substitute dITP or dUTP into the compositions at the same working 
concentrations. 

To reduce component denaturation, storage of the reagent compositions is preferably in 
conditions of diminished Ught, e.g., in amber or otherwise opaque containers or in storage areas with 
controUed low lighting. The ready-to-use reagent compositions of the present invention are 

1 0 unexpectedly stable at ambient temperature (about 20 "-IS C) for about 4- 1 0 weeks, are stable for 
at least one year upon storage at 4'C, and for at least two years upon storage at -20°C. 
Surpriangly, storage of the compositions at temperatures below freezing (e.^., -20''C to -lO'C), as 
is traditional with stock solutions of bioactive components, is not necessary to maintain the stability 
of the compositions of the present invention. 

15 In other preferred embodiments, the compositions of the present invention may be assembled 

into kits for use in nucldc acid amplification or sequencing. Sequencing kits according to the present 
invaition comprise a carrier means, such as a box, carton, tube or the like, having in close 
confinement th«-ein one or more container means, such as vials, tubes, ampules, bottles and Uie like, 
wherein a first container means contains a stable composition comprising a mixture of reagents, at 

20 woricing concentrations, which are at least one tiiermostable DNA polymerase, at least one buffer 
salt, at least one deoxynudeoside triphosphate and at least one dideoxynucleoside triphosphate. The 
sequencing kits may fiirtho- comprise additional reagents and compounds necessary for carrying out 
standard nucleic sequencing protocols, such as pyrophosphatase, agarose or polyaoylamide media 
for formulating sequencing gels, and other components necessary for detection of sequenced nucleic 

25 acids {See U.S. Patent Nos. 4,962,020 and 5,498,523, which are directed to methods of DNA 
sequencing). 

SimOariy, amplification kits according to the present invention comprise carrier means, such 
as a box, carton, tube or the like, having in close confinement therein one or mon container means, 
such as vials, tubes, ampules, bottles and the Uke, wherein a first container means contains a stable 
30 composition comprising a mixture of reagents, at working concentrations, which are at least one 
thermostable DNA polymerase, at least one buffer salt, and at least one deoxynudeoside 
triphosphate. The amplification kits encompassed by tiiis aspect of the present invention may fijrther 
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comprise additional reagents and compounds necessary for carrying out standard nucleic 
amplification protocols {See U.S. Patent Nos. 4,683,195 and 4,683,202, which are directed to 
methods of DNA amplification by PGR). 

5 Use of the Reagent Compositions 

The compositions and kits embodied in the present invention will have general utility in any 
application utiliang nucleic add sequencing or amplification methodologies. Amplification methods 
in which the present compositions may be used include PGR (U.S. Patent Nos. 4,683,195 and 
4,683,202), Strand Displacement Amplification (SDA; U.S. Patent No. 5,455,166; EP 0 684 315), 

10 and Nucldc Acid Sequence-Based Amplification (NASBA; U.S. Patent No. 5,409,81 8; EP 0 329 
822). Nucleic acid sequencing techniques which may employ the present compositions include 
dideoxy sequencing methods such as those disclosed in U.S. Patent Nos. 4,962,022 and 5,498,523, 
as well as more complex PCR-based nucleic acid fingerprinting techniques such as Random 
Amplified Polymorphic DNA (RAPD) analysis (Williams, J.G.K.. et ai, NucL Acids Res. 

15 I8(22):653l^535, 1990), Arbitrarily Primed PGR (AP-PGR; Welsh, J., and McGleUand, M., NucL 
Adds Res, 18(24)mum\%, 1990), DNA Amplification Fingerprinting (DAF; Caetano-Anolles et 
al, Bio/Technology 9:553-557, 1991), miaosateliite PGR or Directed Amplification of Minisatellite- 
region DNA (DAMD; Heath, D.D., et aL, NucL Acids Res. 21(24): 5782-5785, 1993), and 
Amplification Fragment Length Polymorphism (AFLP) analysis (EP 0 534 858; Vos, P., et aL, NucL 

20 Acids Res. 23(2J)A407'44\4, 1995; Lin, J.J., and Kuo, J., FOCUS J7(2):66-70, 1995). In 
particular, the compositions and kits of the present invention will be usefijl in the fields of medical 
therapeutics and diagnostics, forensics, and agricultural and other biolo^cal sciences, in any 
procedure utilimig thermostable DNA polymerases. Furthermore, the methods by vMch the 
compoations of the present invention are formulated may be extendable to all thermostable enzymes 

25 or mixtures thereof, and may allow the formulation of ready-to-use compositions of a variety of 
bioactive enzymes or other proteins that demonstrate increased stability upon extended storage at 
temperatures above fi*eezing. 

The compositions and kits of the invention are particularly usefiil in methods for amplifying 
and sequencing nucleic acid molecules. Nucleic acid amplification methods according to this aspect 

30 of the invention comprise contacting a nucleic acid molecule to be amplified with one or more of the 
compositions of the invention, thus providing a population of amplified copies of the nucleic acid 
molecule. Nucleic acid sequencing methods according to this aspect of the invention comprise 
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contacting the nucleic acid molecule to be sequenced with one or more of the compositions of the 
invention. According to these methods, amplification and sequencing of the nucleic acid molecule 
may be accomplished by any of the above-described amplification and sequencing techniques, most 
preferably by PGR. The present amplification and sequencing methods are particularly useful for 
amplification and sequencing of large nucleic acid molecules (e.g., by "long PGR"), preferably nucleic 
acid molecules that are larger than about 4-8 kilobases in size, more preferably larger than about 5-7 
kilobases in size, and most preferably nucleic acid molecules that are larger than about 7 kilobases 
in size. 



It will be readily apparent to those of ordinary skill in the relevant arts that other suitable 
modifications and adaptations to the methods and applications described herein are obvious and may 
be made without departing fi-om the scope of the invention or any embodiment thereof Having now 
described the present invention in detail, the same will be more clearly understood by reference to 
the following examples, which are included herewith for purposes of illustration only and are not 
intended to be limiting of the invention. 



Example 1: Formulation of Standard Compositions 

As an initial step in formulating stable, ready-to-use reagent compositions for nucleic acid 
amplification and sequencing, components were mixed in varying amounts as shown in Table 1 to 
provide 24 different formulations. The pH on all formulations was adjusted to about 8.3. After 
filtration through a low protein-binding filter, all compositions were formulated with 20 units/ml of 
Taq polymerase and were suitable for use as standard compositions for amplification or sequencing 
of nucleic acid fragments up to about 5-6 kilobases in size. 
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Example 2: Stability of Standard Comp sitions 



To examine the stability of the standard compositions formulated in Example 1, samples of 
each formulation were aliquoted and stored, under diminshed light, at ambient temperature (about 
5 20''-25*'C), 4°C, -20°C and -70°C. Samples of each formulation from each temperature were taken 
daily for the first week, and weekly thereafter, and used in stability assays. These stability assays 
were performed by amplifying, via standard PGR, suboptimal amounts of human genomic DNA as 
a template uang a template titration (if few samples were to be compared in a time point) or a fixed 
amount of template (if a larger number of samples were to be compared). To the desired amount of 

1 0 template DNA in a given formulation, 10 picomoles of primer was added, and the reaction mixtures 
were subjected to 35 cycles of PGR of 30 seconds at 94"C, 30 seconds at 55**G and 1 
minute/kilobase at 72**C. A portion of each reaction was then subjected to agarose gel 
electrophoresis and visualized by ethidium bromide fluorescence under ultraviolet illumination. 

The results of these assays indicated that certain of the formulations demonstrated enhanced 

15 stability upon storage. As shown in Figure 1, after about three months of storage at 4**G, 
formulations 15 and 19 had completely lost enzymatic activity, as evidenced by an absence of bands 
in the lanes corresponding to these samples. At this same time point, however, formulations 10-14, 
16-18 and 20-24 demonstrated about the same levels of enzymatic activity as a freshly made 
("control") formulation. 

20 Storage temperature was also found to have a significant effect upon the stability of the 

formulations, even within a given formulation. As shown in Figure 2, when samples of formulation 
4 were examined after about 15 weeks of storage at dther 4*C or about 20-25"^ the samples stored 
at 4*'C reUined fiill enzymatic activity when compared to a control sample. Those stored at about 
20-25 "G, however, had lost some activity, as indicated by the lower yields of the target fragment 

25 obtained at all template concentrations. 

The results for all of the fomiulations at the various storage temperatures are summarized in 
Table 2. 
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These results indicate that several of the compositions unexpectedly maintained enzymatic activity 
for 6-12 weeks upon storage at 20-25 ^C, and for over one year at 4°C. Formulation 19, however, 
had completely lost activity within 24 hours of formulation. Formulation 1 5 also exhibited a rapid 
loss of activity. 

For further analysis of the stability of these ready-to-use compositions, several formulations 
were stored at 20-25 **C or at 4°C for up to six months, with samples taken monthly for stability 
assays performed by a determination of polymerase unit activity. The results of these assays are 
summarized in Tables 3 and 4. 



Table 3. Stability of Standard CoMPosmoNs 
(Percentage of Enzyme AcrivrrY Remaining) 
Upon Storage at 20-25**C 



20 



Formulation No.* 


1 Month 


2 Months 


3 Months 


4 Months 


ID 


89 


65 


nd^ 


nd 


11 


106 


3 


nd 


nd 


12 


106 


91 


nd 


nd 


13 


93 


72 


nd 


nd 


14 


91 


76 


nd 


nd 


15 


78 


63 


nd 


nd 


16 


94 


85 


nd 


nd 


17 


89 


90 


nd 


nd 


18 


90 


84 


nd 


nd 


19 


0 


0 


nd 


nd 


20 


88 


81 


72 


77 


21 


nd 


103 


nd 


nd 


22 


97 


84 


nd 


nd 


23 


83 


77 


nd 


nd 


24 


81 


106 


nd 


nd 



^ Formulation numbers correspond to those in Table 1 . 
^ nd = not done 
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Table 4, Stability of Standard Compositions 
(Percentage of Enzyme Acrrvrry Remaining) 
Upon Storage at 4°C 



10 



15 



20 



Formulation No.' 


1 Month 


2 Months 


4 Months 


5 Months 


6 Months 


10 


84 


86 


98 


nd* 


105 


u 


94 


97 


98 


nd 


nd 


12 


94 


97 


106 


nd 


nd 


13 


86 


93 


93 


nd 


nd 


14 


85 


91 


105 


nd 


nd 


15 


89 


89 


98 


nd 


nd 


16 


88 


103 


104 


nd 


nd 


i? 


83 


94 


91 


nd 


nd 


18 


90 


97 


99 


nd 


nd 


19 


95 


59 


nd 


nd 


38 


20 


97 


100 


94 


94 


103 ^ 


21 


93 


97 


100 


nd 


nd 


22 


100 


109 


35 


nd 


nd 


23 


94 


97 


89 


nd 


107 


24 


93 


94 


97 


nd 


nd 



' Formulation numbers correspond to those in Table I. 



^ nd - not done 

25 

Several of the fonnulations were stable upon storage at 20-25"C, most notable formulation 
20 which retained >70% of its enzymatic activity even after storage for four months at this 
temperature. As described earlier, formulation 19 had completely lost activity within the first month, 
as determined by the PCR assay. Interestingly, however, as determined by the polymerase unit 
30 activity assay, formulation 19 was stable for one month when stored at 4''C but had lost substantial 
activity by the second month of storage at this temperature (Table 4). Formulation 20. shown 
previously to be stable upon extended storage at 20-25 °C. was stable upon storage at 4°C for at 
least sbc months. 
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Taken together, these results indicate that the compositions of the present invention are 
readily suitable for use in nucleic acid amplification reactions, and demonstrate extended stability 
upon storage at 20-25°C or 4°C. 

5 Example 3: Formulation and Stability of Large Sequence Compositions 

For use in amplification and sequencing of nucleic acid fragments larger than 5-6 kilobases, 
it has been suggested as described above that a mixture of Tag and VENT™ or DEEPVENT™ 
polymerases (U.S. Patent No. 5,436,149; Barnes, Id), or ofTth and either Tli, Pyococcus or Tma 

10 (U.S. Patent No. 5.512,462), be used. Accordingly, Tag and DEEPVENT™ DNA polymerases 
were formulated, at activity ratios of 100:1 (for samples 1-3) or 50:1 (for sample 4) into solutions 
containing the buffer salts, cofactors and detergents shown in Table 5. Each of these fomiulations 
was adjusted to about pH 9. 1, which is optimal for the activity of DEEPVENT™ DNA polymerase 
(Bq and Mahbubani, Id). Samples were then stored at about 20-25'*C or at 4"'C and assayed weekly 

15 for 12 weeks, and monthly thereafter, in stability assays in which a 20 kilobase target in 100 
nanograms of human genomic template was amplified by PCR. Reaction mixtures included 10 
picomoles of primer and were subjected to 35 PCR cycles of 30 seconds at 94°C, 30 seconds at 
62''C and 1 minute/kilobase at GZ'C. Portions of the reaction were subjected to agarose gel 
electrophorKis and were visualized by ethidium bromide fluorescence under ultraviolet illumination, 

20 as shown above for Figures 1 and 2. Results are summarized in Table 5. 
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Table 5. Stability of Large Sequence Compositions 



Formulation No. 




Stability at: 


Formulation 


20-25«C 


4°C 




66mMTris-SO, (pH9.l) 








l9,8mM(NH,)j SO, 








2.2inMMgSO, 


< 12 weeks 


1 1 months 


1 


22 units/ml Tag DNA Polymerase 




0.22 units/ml DEEPVENT DNA Polymerase 








O.liyoTweeD-20 








0.011% NP-40 








66 mMTris-SO,(pH 9.1) 








l9.8mM(NHJj SO, 








2.2mMMgSO, 


< 1 2 weeks 


>I I months 


2 


24.42 units/ml Tag DNA Polymerase 




0.242 unils/ml DEEPVENT DNA Polymerase 








0.066% Tween-20 








0.066% NP^O 








66mMTris-S04 (pH9.l) 








19.8mM(NH,), SO, 








2.2mMMgSO, 




1 1 months 


3 


22 units/ml Tag DNA Polymerase 






0.22 miitsAnl DEEPVENT DNA Polymerase 








0.01%Twccn-20 








0.01% NP-40 








66mMTris-S04(pH9.1) 








l9.8mM(NH,),S0, 








2.2raMMgSO, 






4 


22 units/ml Tag DNA Polymerase 




1 1 months 




0.44 units^al DEEPVENT DNA Polymerase 








0.01%Twecn-20 








0.01% NP-40 







* nd = not done 
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Upon storage at ambient temperature (ZO'^-ZSX), aD of the formulations were stable for 6-12 
weeks. Storage of these formulations at 4°C provided enhanced stability of over 1 1 months. Similar 
results may be obtained with formulations in which Taq and Tm DN A polymerases were used in an 
activity ratio of 1:1, 1:2, 1:4, 1:5, 1:8, 1:10, 1:25, 1:50, 1:100, 1:250, 1:500, 1:1000 or 1:2000. 
5 These results indicate that the large sequence compositions of the present invention are readily 
suitable for use in amplification of nucleic acid sequences larger than 5-6 kilobases and demonstrate 
extended stability upon storage at 20° to 25X, or at 4°C. 

Example 4: Combinations of Thermus flavis (T/l) DNA Polymerase and Thermotoga 
10 neapolitana (Tne) DNA Polymerase 

To examine other DNA polymerase compositions for their utility in amplification of nucleic 
add molecules, a mixture of Tfl and Tne DNA polymerases, at a 1 : 1 ratio, was used to amplify the 
2.7 kilobase Pucl9 plasmid. Amplification reactions were in a 50 pi final volume in buffer containing 

15 I mM magnesium acetate. 80 pg of Pucl9 linearized by treatment with AMI was used as the 
template, and was contacted with 1 pi of enzyme mixture. PCR conditions were 1 min at 94**C, 
followed by 35 cycles of 94**C for 30 seconds/60*C for 30 seconds/68**C for 5 minutes. 

Upon analysis of the amplification products by gel electrophoresis, this composition 
comprising Tfl and Tne DNA polymerases was found to efficiently amplify the 2.7 kilobase Pucl9 

20 plasmid. The efl5ciency of amplification was comparable to amplification of Pucl9 using 1 pi of Taq 
DNA polymerase. 

Example 5: Amplification of Genomic DNA Using TfllTne Compositions 

25 

Having demonstrated that compositions comprising Tfl and Tne DNA polymerases efficiently 
amplify plasmid-sized nucleic acid molecules, these compositions were examined for their ability to 
amplify nucleic acid molecules from genomic DNA templates. Six different primer sets were 
constructed (ranging in size finom 0.25 to 4.1 kilobases) and used to amplify the human p-globin gene 
30 from a genomic DNA template from the K562 human leukemia cell line. Each reaction was 
performed in a volume of 50 pi comprising template at 40 ng/reaction. and Tfl/Tm mixture at either 
0.5 unit/reaction or 1 unit/reaction (7/7 and The at a 1:1 ratio in both mixtures). PCR conditions 
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10 



were 1 min at 94°C. foUowed by 35 cycles of 94'C for 30 seconds/55 °C for 30 second/eg-C for 5 
minutes. 

Upon analysis of the amplification products by gel electrophoresis, efficient amplification was 
observed for all primers. The sizes of the amplification products produced using the different primers 
were 0.25 kilobase. 0.7 kilobase, 1.1 kilobases. 2.0 kilobases and 4.1 kilobases. These resuks 
demonstrate that the TfllTne compositions efficiently amplify nucleic acid molecules derived fi^om 
genomic DNA templates. 

Example 6: Use of Various Enzyme Ratios in DNA Polymerase Compositions 



f 

To determine the efficacy of different enzyme ratios in nucleic acid amplification, a 
composition comprising a higher amount of Tne {V exo+) DNA polymerase than Tfl (3' exo-) DNA 
polymerase was made and tested for its ability to amplify Pucl9 and the p-globin gene. A 1:3 
mfacture of TJlfTne was made and used to amplify Pucl9 under the conditions described in Example 

15 4, and P-globin under the conditions described in Example 5. 

Upon analysis of the amplification products by gel electrophoresis, efficient amplification was 
observed for both Pucl9 and for p-globin (all template sizes). These resuhs demonstrate that 
compositions in which a 3' exo+ DNA polymerase {Tne) is present in higher quantity than a 3* exo- 
DNA polymerase (Tfl) efficiently amplify nucleic acid molecules derived from plasmid and genomic 

20 DNA templates. 



Example 7: Amplirication of Large Nucleic Add Molecules Using DNA Polymerase 
Mixtures 

25 

Having demonstrated that compositions comprising mixtures of 3' exo+ and 3' exo- DNA 
polymerases efficiently amplify plasmid-sized and small genomic nucleic acid molecules, such 
compositions were examined for their ability to amplify larger nucleic acid molecules. For these 
studies, mfartures of Tag (3* exo+) DNA polymerase and Tma (3' exo-) DNA polymerase (ULTma™; 
30 LTl, Gaithersburg, Maryland) were prepared. Two sets of mixtures were prepared: one set 
contained 1 unit of Tag and varying amounts of ULTma (0.3 unit, 0.6 unit, 0.8 unit or 1 unit), and 
the other set contained no Taq and only varying amounts of ULTma (0.3 unit, 0.6 unit, 0.8 unit or 
1 unit). These compositions were used to amplify 80 mg of human genomic DNA using primers 
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spedfic for a 7.5 kflobase region of the human P-globin gene. PCR conditions were 1 min at 94°C, 
followed by 35 cycles of 94X for 30 seconds/60X for 30 seconds/68°C for 5 minutes. 

Upon analysis of the amplification products by gel electrophoresis, the 7.5 kilobase p-globin 
fragment was found to be efficiently amplified by all compositions comprising both Taq and Tma 
5 DNA polymerase. Compositions comprising only Tma DNA polymerase, however, were unable to 
amplify this large fi-agment. In control experiments, compositions comprising only Taq DNA 
polymerase were also unable to amplify this large fi-agment. These results demonstrate that 
compositions comprising a mixture of 3' exo+ and 3' exo- DNA polymerases are usefijl in efficiently 
amplifying large nucleic acid molecules, particularly in amplifying nucleic acid molecules larger than 
10 about 7 kilobases in size. 

Example 8: Amplification ofLarge Nucleic Acid Molecules Using Various Ratios of Taq and 
Tma DNA Polymerases 

15 

Having demonstrated that compositions comprising mixtures of 3* exo+ {Taq) and 3' exo- 
(7ma) DNA polymerases efficiently amplify large nucleic acid molecules, compositions comprising 
these enzymes in various ratios were made and tested for their abilities to amplify the 7.5 kilobase 
P-globin gene Segment from Example 7. Two sets of mixtures were prepared: one set contained 1 

20 unit of ULTma DNA polymerase and different amounts (0.25, 0.5, 0.6, 0.7, 0,8, 0.9 or 1 .0 unit) of 
Taq DNA polymerase, and the other set contained 1 unit of Taq DNA polymerase and different 
amounts (1.0, 1.1, 1.2. 1.3, 1.4, 1,5, 2,0, 3.0, 4,0 or 6.0 units) of ULTma DNA polymerase. 
Amplification templates, primers and cycling conditions were as described for Example 7. 

Upon analysis of the amplification products by gel electrophoresis, efficient amplification of 

25 the 7 .5 kilobase P-globin gene was observed for the first set of mixtures only in those compositions 
containing 0.5 to 1 .0 unit of Taq DNA polymerase and 1 .0 unit of ULTma. Compositions in this set 
containing less than 0.5 unit of Taq DNA polymerase did not amplify this firagment. However, all 
en^e mbctures in the other set, /.c, compositions comprising 1 .0 unit of Taq DNA polymerase and 
1.0 to 6.0 units ofULTmaDNA polymerase, demonstrated efficient amplification of the 7.5 kilobase 

30 fi^agment. In separate experiments, a 13.5 kilobase fi^agment of the |J-globin gene was efficiently 
amplified using mixtures containing a 1 : 1 or a 1 :2 ratio of Taq to ULTma. Together, these results 
indicate that compositions in which Tma DNA polymerase is present in equal or higher quantity than 
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Taq DNA polymerase efficiently amplify large nucleic acid molecules, particularly those that are 
larger than about 7-13 kilobases in size. 

Example 9: Use of Taq and Tne DNA Polymerase Mixtures for Long PCR 

5 

To determine if other DNA polymerases could be used in compositions also comprising Taq 
DMA polymerase in amplification of large nucleic acid molecules, various mixtures of Taq and Tne 
DNA polymerases were made. For these experiments, a Tne DNA polymerase deletion mutant (5' 
exo-; 3' exo+) was mixed in amounts ranging from 0.05 to 2.0 units with 1 unit of Taq DNA 
1 0 polymerase and used to amplify the 7.5 kilobase p-globin fragment under conditions described for 
Example 7. 

Upon analysis of the amplification products by gel electrophoresis, all of the combinations 
of Tne and Taq DNA polymerases were found to efficiently amplify the 7.5 kilobase DNA fragment. 
These results indicate that compoations comprising combinations of Taq and Tne DNA polymerases 
1 5 are useful in amplifying large nucleic acid molecules, particularly those larger than about 7 kilobases 



m size. 



Having now fijlly described the present invention in some detail by way of illustration and 
example for purposes of clarity of understanding, it will be obvious to one of ordinary skill in the art 

20 that the same can be performed by modifying or chan^ng the invention within a wide and equivalent 
range of conditions, formulations and other parameters without affecting the scope of the invention 
or any specific embodiment thereof, and that such modifications or changes are intended to be 
encompassed within the scope of the appended claims. 

All publications, patents and patent applications mentioned in this specification are indicative 

25 of the level of skill of those skilled in the art to which this mvention pertains, and are herein 
incorporated by reference to the same extent as if each individual publication, patent or patent 
application was specifically and individually indicated to be incorporated by reference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: Life Technologies, Inc. 

(B) STREET: 9800 Medical Center Drive 

(C) CITY: Rockville 
10 (D) STATE: Maryland 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 20850 

(ii) TITLE OF INVENTION: Stable Compositions for Nucleic Acid 
15 Amplification and Sequencing 

(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 
20 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

CD) SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) 

25 (vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/689,815 

(B) FILING DATE: 14 -AUG- 1996 

(vi) PRIOR APPLICATION DATA: 
30 (A) APPLICATION NUMBER: US 08/801,720 

(B) FILING DATE: 14 -FEB- 1997 



35 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 893 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 
50 1 5 10 15 



Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly He Pro Thr 
20 25 30 

Asn Ala Val Tyr Gly Val Ala Arg Met Leu Val Lys Phe He Lys Glu 

35 40 45 
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10 



15 



25 



30 



40 



45 



55 



His He He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lvs 
50 55 60 

Ala Ala Thr Phe Arg His Lys Leu Leu Val Ser Asp Lye Ala Gin Arg 
65 70 75 80 

Pro Lys Thr Pro Ala Leu Leu Val Gin Gin Leu Pro Tyr He Lys Arg 
85 90 95 

Leu He Glu Ala Leu Gly Phe Lys Val Leu Glu Leu Glu Gly Tyr Glu 
100 105 110 

Ala Asp Asp He He Ala Thr Leu Ala Val Arg Ala Ala Arg Phe Leu 
115 120 125 

Met Arg Phe Ser Leu He Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
130 135 140 



Asn Glu Lys He Lys Val Trp Arg He Val Lys Gly He Ser Asp Leu 
20 145 150 155 160 



Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 
165 170 175 

His Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Asp He Asp Aen 
180 185 190 

He Pro Gly Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 215 220 



Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 
35 225 230 235 240 



Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 
245 250 255 

Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
260 265 270 

Pro He Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 

Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu He Val Lys Asp His 
290 295 300 



Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 
50 305 310 315 320 



Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 

Val Gly He Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 
340 345 350 
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Leu HiB His Arg Aen Ala His Asn Leu Asp Glu Thr Leu Val Leu Ser 
355 360 365 

Lys Leu Lys Glu lie Leu Glu Asp Pro Ser Ser Lys lie Val Gly Gin 
370 375 380 



10 



Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly lie Ser Pro 

385 390 395 400 

Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 

405 410 415 



15 



Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 430 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu 
435 440 445 



20 



Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Glu Tyr 
450 455 460 



25 



Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys He Leu Ser 
465 470 475 480 

Met Lye Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg He Glu 
485 490 495 



30 



Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Phe Asn Trp Val Tyr 
500 505 510 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
515 520 525 



Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He Ala Gly Glu Pro Phe 
35 530 535 540 



40 



45 



Asn He Asn Ser Pro Lys Gin Val Ser Asn He Leu Phe Glu Lys Leu 
545 550 555 560 

Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Asp Tyr Ser Thr 
565 570 575 

Arg He Glu Val Leu Glu Glu He Ala Asn Glu His Glu He Val Pro 
580 585 590 

Leu He Leu Glu Phe Arg Lys He Leu Lys Leu Lys Ser Thr Tyr He 
595 600 605 



50 



Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg Phe His Ala 
610 615 620 



55 



Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
"5 630 635 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu He 
645 650 655 
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Arg Lye Ala He Val Pro Gin Asp Pro Asp Trp Trp He Val Ser Ala 
660 665 €70 

Abp Tyr Ser Gin He Glu Leu Arg He Leu Ala His Leu Ser Gly Asp 
5 675 680 6B5 

Glu Asn Leu Val Lys Ala Phe Glu Glu Gly He Asp Val His Thr Leu 
690 695 700 

10 Thr Ala Ser Arg He Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 

705 710 715 720 



15 



30 



40 



45 



Met Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 
725 730 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly He Pro Val Lys Glu Ala 
740 745 750 



Glu Lys Met He He Ser Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
20 755 760 765 

Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 780 

25 Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 

785 790 795 800 

Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala He Asn Thr Pro He 
805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Asp He Asp 
820 825 830 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met He He Gin Val 
35 835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 855 860 



Val Asp Leu Val Lys Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 
865 870 875 880 

Pro Leu Glu Val Asp He Ser He Gly Lys Ser Trp Ser 
885 890 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 893 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: peptide 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly He Pro Thr 
20 25 30 

Asn Ala Val Tyr Gly Val Ala Arg Met Leu Val hys Phe He Lye Glu 
10 35 40 45 



15 



20 



30 



35 



45 



50 



55 



His He He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lys 
50 55 60 

Ala Ala Thr Phe Arg His Lys Leu Leu Val Ser Asp Lys Ala Gin Aro 
^5 70 75 ao 

Pro Lys Thr Pro Ala Leu Leu Val Gin Gin Leu Pro Tyr He Lys Arq 
85 90 95 

Leu He Glu Ala Leu Gly Phe Lys Val Leu Glu Leu Glu Gly Tyr Glu 
100 105 110 



Ala Asp Asp He He Ala Thr Leu Ala Val Arg Ala Ala Arg Phe Leu 
25 115 120 125 



Met Arg Phe Ser Leu He Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
130 135 140 

Aan Glu Lys He Lys Val Trp Arg He Val Lys Gly He Ser Asp Leu 
150 155 160 

Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 
165 170 

His Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Asp He Asp Asn 
180 185 190 



He Pro Gly Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 



Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 215 220 

Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 
230 235 240 

Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 
245 250 255 

Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
260 265 270 

Pro He Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 
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Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu He Val Lys Asp His 

290 295 300 

Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 

305 310 315 320 



10 



Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 

Val Gly He Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 
340 345 350 



15 



Leu His His Arg Asn Ala His Asn Leu Asp Glu Thr Leu Val Leu Ser 
355 360 365 

Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser Lys He Val Gly Gin 
370 375 380 



Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly He Ser Pro 
20 385 390 395 400 



25 



Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 430 



30 



Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu 
435 440 445 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Glu Tyr 

450 455 460 



Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys He Leu Ser 
35 465 470 475 480 



40 



Met Lye Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg He Glu 
485 490 495 

Met Pro Leu Val Asn Val I,eu Ala Arg Met Glu Phe Asn Trp Val Tyr 
500 505 510 



45 



Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
515 520 525 

Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He Ala Gly Glu Pro Phe 
530 535 540 



Asn He Asn Ser Pro Lys Gin Val Ser Asn He Leu Phe Glu Lys Leu 
50 545 550 555 560 



55 



Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Asp Tyr Ser Thr 
565 570 575 

Arg He Glu Val Leu Glu Glu He Ala Asn Glu His Glu He Val Pro 
580 585 590 
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10 



15 



25 



30 



55 



Leu He lieu Glu Phe Arg Lys He Leu Lys Leu Lys Ser Thr Tyr He 
595 600 605 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg Phe His Ala 
610 615 620 

Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asd 
630 635 ^40 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu He 

650 655 

Arg Lys Ala He Val Pro Gin Aep Pro Asp Trp Trp He Val Ser Ala 

665 670 

Asp Tyr Ser Gin He Glu Leu Arg He Leu Ala His Leu Ser Gly Asp 
675 680 ias 



Glu Asn Leu Val Lys Ala Phe Glu Glu Gly He Asp Val His Thr Leu 
20 690 695 700 



Thr Ala Ser Arg He Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 
'^^^ 710 715 720 

Met Arg Arg Val Gly Lys Met Val Asn Tyr Ser He He Tyr Gly Val 
725 730 735 

liar Pro Tyr Gly Leu Ser Val Arg Leu Gly He Pro Val Lys Glu Ala 
740 745 750 

Glu Lys Met He He Ser Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
755 760 765 



Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 780 

Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 
'^^^ 790 795 800 

40 Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala He Asn Thr Pro He 

805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Asp He Asp 
45 «20 825 830 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met He He Gin Val 
835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 855 860 



Val Asp Leu Val Lys Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 

875 880 

Pro Leu Glu Val Asp He Ser He Gly Lys Ser Trp Ser 
885 890 
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What Is Claimed Is: 



I A stable composition comprising a mixture of reagents at working concentrations, 
wherein said reagents are at least one thermostable enzyme and at least one buffer salt. 

5 

2. A stable composition for nucleic acid amplification comprising a mixture of reagents, 
wherein said reagents are at least one thermostable DN A polymerase, at least one buffer salt and at 
least one deoxynucleoside triphosphate. 



10 

3 . A stable composition for nucleic acid sequencing comprising a mixture of reagents, 
wh^ein said reagents are at least one thermostable DMA polymerase, at least one deoxynucleoside 
triphosphate, at least one dideoxynucleoside triphosphate and at least one buffer salt. 

15 4. The stable composition of claim 2 or claim 3, wherein said reagents are present at 

working concentrations. 

5. The composition of claim 2 or claim 3, wherein said thermostable DNA polymerase 
is selected from the group of themiostable DNA polymerases consisting of a Taq DNA polymerase. 

20 a Tne DNA polymerase, a Tma DNA polymerase, and mutants thereof 

6. The compoation of claim 2 or claim 3, wherein said thermostable DNA polymerase 
is selected from the group of thermostable DNA polymerases consisting of a Pfu DNA polymerase, 
a Pwo DNA polymerase, VENT™ DNA polymerase, DEEPVENT™ DNA polymerase, and mutants 

25 thereof 



7. The composition of claim 5, wherein said mixture further comprises DEEPVENT™ 
DNA polymerase or VENT™ DNA polymerase. 

8. The composition of claim 5, wherein the concentration of Taq DNA polymerase or 
mutant thereof is about 0. 1 to 200 units per milliliter. 
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9. The composition of claim 8, v^erdn the concentration is about 20 units per milliliter 

10. The composition of claim 5, wherein the concentration of Tne DNA polymerase or 
mutant thereof is about 0.1 to 200 units per milliliter. 

5 

1 1 The composition of claim 10, wherein the concentration is about 20 urats per milliliter. 

12. The composition of claim 5, wherein the concentration of Tma DNA polymerase or 
mutant thereof is about 0. 1 to 200 units per milliliter. 

10 

1 3 . The composition of daim 1 2, wherein the concentration is about 20 units per milliliter. 

14. The composition of claim 6, wherein the concentration of VENT™ DNA polymerase 
or mutant thereof is about 0. 1 to 200 units per milliliter. 

15 

1 5. The composition of claim 14, wherein the concentration is about 20 units per milliliter. 

16. The composition of claim 6, wherein the concentration of DEEP VENT™ DNA 
polymerase or mutant thereof is about 0.1 to 200 units per milliliter 

20 

1 7. The composition of claim 1 6 vrfierdn the conc«itration is about 20 units per milliliter 

1 8. The composition of claim 6, wherein the concentration of PJu DNA polymerase or 
mutant thereof is about 0. 1 to 200 units per milliliter. 

25 

19. The composition of claim 1 8 wherein the concentration is about 20 units per milliliter. 

20. The compo^tion of claim 6, wherein the concentration oiPwo DNA polymerase or 
mutant thereof is about 0, 1 to 200 units per milliliter 

30 

21. The composition of daim 20 wherein the concentration is about 20 units per milliliter. 
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22. The composition of claim 7, wherein the concentration of DEEPVENT™ DNA 
polymerase or VENT DNA polymerase is about 0.002 to 200 units per milliliter. 

23. The composition of claim 22, wherein the concentration is about 0.40 units per 
milliliter. 



24. The composition of claim 2 or claim 3, wherein said DNA polymerase retains at least 
90% of the enzymatic activity for at least four weeks when stored at about 20°C to 25°C. 

25. The composition of claim 5, wherein said DNA polymerase retains at least 90% of 
the en2ymatic activity for at least one year when stored at about 4°C. 

26. The composition of claim 2 or claim 3, fiirther comprising a magnesium salt. 

27. The composition of claim 2 or claim 3, further comprising at least one nonionic 
detergent. 



28. The composition of claim 2 or claim 3, wherein the concentration of said 
deoxynucleoside triphosphate is about 200 to about 300 micromolar. 

29. The composition of claim 3, wherein the concentration of said dideoxynucleoside 
triphosphate is about 0.08 to about 5 micromolar. 



30. A nucleic acid amplification kit comprising one or more containers, wherein a first 
container contains a stable composition comprising a mbcture of reagents, wherein said reagents are 
at least one thermostable DNA polymarase, at least one buflFer salt, and at least one deoxynucleoside 
triphosphate. 



31. A nucleic acid sequencing kit comprising one or more containers, wherein a first 
container contains a stable composition comprising a mixture of reagents, wherein said reagents are 
at least one thermostable DNA polymerase, at least one buffer salt, at least one deoxynucleoside 
triphosphate and at least one dideoxynucleoside triphosphate. 
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32. The kit of claim 30 or 31, wherein said reagents are present at working 
concentrations. 



33. A method of amplifying a nucleic acid molecule comprising contacting said nucleic 
acid molecule with the composition of claim 2. 

34. A method of amplifying a nucleic acid molecule comprising contacting said nucleic 
acid molecule with a composition selected from the group consisting of; 

a composition comprising a thamostable 3* exo+ DNA polymerase and a thermostable 3* exo- 
DNA polymerase wherein the concentrations of said 3' exo+ DNA polymerase and of said 3' exo- 
DN A polymerase are equal, and 

a composition comprising a thenmostable 3* exo+ DNA polymerase and a thermostable 3' exo- 
DNA polymerase wherein the concentration of said 3' exo+ DNA polymerase is higher than the 
concentration of said 3' exo- DNA polymerase. 

35. A method of sequencing a nucleic acid molecule comprising contacting said nucleic 
acid molecule with the composition of claim 3. 

36. A method of sequencing a nucleic acid molecule comprising contacting said nucleic 
acid molecule with a composition selected from the group consisting of: 

a composition comprising a themiostable 3' exo+ DNA polymerase and a thermostable 3' exo- 
DNA polymerase wherein the concentrations of said 3* exo+ DNA polymerase and of said 3* exo- 
DNA polymerase are equal, and 

a composHion comprising a thermostable 3' exo+ DNA polymerase and a thermostable 3' exo- 
DNA polymerase wherein the concentration of said 3' exo+ DNA polymerase is higher than the 
concentration of said 3' exo- DNA polymerase. 

37. The method of any one of claims 33-36, wherein said nucleic acid molecule is larger 
than about 4 kilobases in size. 

38. The method of claim 37, wherein said nucleic acid molecule is larger than about 7 
kilobases in size. 
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39. The method of claim 38, wherein said nucleic acid molecule is larger than about 8 
kilobases in size. 

40. A nucleic acid molecule amplified by the method of claim 33 or claim 34. 

41 . The nucleic acid molecule of claim 40, wherein said nucleic acid molecule is larger 
than about 4 kilobases in size. 



42. The nucleic acid molecule of claim 41, wherein said nucleic acid molecule is larger 
10 than about 7 kilobases in size. 



43. The nucleic acid molecule of claim 41, wherein said nucleic acid molecule is larger 
than about 8 kilobases in size. 
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